Towards Pose Estimation of 3D Objects in Monocular Images via Keypoint Detection
نویسنده
چکیده
The availability of a large number of crowdsourced CAD models of objects can be leveraged to solve the problem of pose estimation of 3D objects in monocular images. Convolutional Neural Networks(CNNs) perform to the best of their capability when they have been trained on a large amount of labeled data. We explore how 3D models can be used to generate lots of training images and annotations in the form of keypoint locations. We propose to use CNNs to first detect keypoints in rendered images. Once, we have a correspondence between 2D points in a test image and the 3D points on the CAD model, we can align 3D models in 2D images.
منابع مشابه
3D Human Pose Estimation in RGBD Images for Robotic Task Learning
We propose an approach to estimate 3D human pose in real world units from a single RGBD image and show that it exceeds performance of monocular 3D pose estimation approaches from color as well as pose estimation exclusively from depth. Our approach builds on robust human keypoint detectors for color images and incorporates depth for lifting into 3D. We combine the system with our learning from ...
متن کاملCamera Pose Estimation in Unknown Environments using a Sequence of Wide-Baseline Monocular Images
In this paper, a feature-based technique for the camera pose estimation in a sequence of wide-baseline images has been proposed. Camera pose estimation is an important issue in many computer vision and robotics applications, such as, augmented reality and visual SLAM. The proposed method can track captured images taken by hand-held camera in room-sized workspaces with maximum scene depth of 3-4...
متن کاملDetection of 3D objects with a multiple-view keypoint model from item images
We present a keypoint based multiple-view model approach towards 3D object detection. Our model addresses the shortcomings of other keypoint based detection methods, since they do not take into account the 3D nature of everyday objects and thus rely on frontal object occurrences. For each 3D object, a model is learned from multiple view images of the object. The learned model is used to perform...
متن کاملShape Priors for Monocular Object Localization in Dynamic Scenes
We tackle the problem of reconstructing moving vehicles in autonomous driving scenarios using only a monocular camera. Though the problem appears to be ill-posed, we demonstrate that prior knowledge about how 3D shapes of vehicles project to an image can be used to reason about the reverse process, i.e., how shapes (back-)project from 2D to 3D. We encode this knowledge in shape priors, which ar...
متن کامل3D Face pose estimation and tracking from a monocular camera
In this paper, we describe a new approach for estimating and tracking three-dimensional (3D) pose of a human face from the face images obtained from a single monocular view with full perspective projection. We assume that the shape of a 3D face can be approximated by an ellipse and that the aspect ratio of 3D face ellipse is given. Given a monocular image of a face, we ®rst perform an ellipse d...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016